Data quality is a typical ethical requirement: we could never trust a piece of information if it did not have the typical data quality properties. Yet, we can also assert the opposite: that data should conform to a high ethical standard for it to be considered of good quality. Hence, the satisfaction of the ethical requirements is actually necessary to assert the quality of a dataset. In this article, we propose to introduce the most common ethical requirements as dimensions of quality, grouped in an Ethics Cluster. By now, we are more than aware that the Internet and the worldwide extent of the usage of information technology (IT) and computers have generated a plethora of datasets in all kinds of application areas. This data can correspond to useful information only if it is of good quality, and let us emphasize that it can be profitable to society only if its usage conforms to ethical principles. With a somehow more constructive and dynamic viewpoint, in this article, we discuss the dimensions of ethics in connection with the various phases of what we call the information extraction process, that is, the process of (i) identifying the data sources containing the information of interest, (ii) collecting the corresponding data and integrating them in order to produce a unique dataset, and (iii) applying the appropriate information extraction methods (from the application of a simple query up to a complex statistical, machine learning or data mining analysis). We thus advocate the need to extend the well-established data quality framework in [5] to incorporate ethics explicitly.

Ethical Dimensions for Data Quality / Firmani, Donatella; Tanca, Letizia; Torlone, Riccardo. - In: ACM JOURNAL OF DATA AND INFORMATION QUALITY. - ISSN 1936-1955. - 12:(2019), pp. 1-5. [10.1145/3362121]

Ethical Dimensions for Data Quality

Firmani Donatella;
2019

Abstract

Data quality is a typical ethical requirement: we could never trust a piece of information if it did not have the typical data quality properties. Yet, we can also assert the opposite: that data should conform to a high ethical standard for it to be considered of good quality. Hence, the satisfaction of the ethical requirements is actually necessary to assert the quality of a dataset. In this article, we propose to introduce the most common ethical requirements as dimensions of quality, grouped in an Ethics Cluster. By now, we are more than aware that the Internet and the worldwide extent of the usage of information technology (IT) and computers have generated a plethora of datasets in all kinds of application areas. This data can correspond to useful information only if it is of good quality, and let us emphasize that it can be profitable to society only if its usage conforms to ethical principles. With a somehow more constructive and dynamic viewpoint, in this article, we discuss the dimensions of ethics in connection with the various phases of what we call the information extraction process, that is, the process of (i) identifying the data sources containing the information of interest, (ii) collecting the corresponding data and integrating them in order to produce a unique dataset, and (iii) applying the appropriate information extraction methods (from the application of a simple query up to a complex statistical, machine learning or data mining analysis). We thus advocate the need to extend the well-established data quality framework in [5] to incorporate ethics explicitly.
2019
data integration; source selection; knowledge extraction
01 Pubblicazione su rivista::01a Articolo in rivista
Ethical Dimensions for Data Quality / Firmani, Donatella; Tanca, Letizia; Torlone, Riccardo. - In: ACM JOURNAL OF DATA AND INFORMATION QUALITY. - ISSN 1936-1955. - 12:(2019), pp. 1-5. [10.1145/3362121]
File allegati a questo prodotto
File Dimensione Formato  
Firmani_Ethical-dimensions_2019.pdf

solo gestori archivio

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 92.04 kB
Formato Adobe PDF
92.04 kB Adobe PDF   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1638681
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 23
  • ???jsp.display-item.citation.isi??? 5
social impact